Self-Adaptive DNN for Improving Spoken Language Proficiency Assessment
نویسندگان
چکیده
Automated assessment of language proficiency of a test taker’s spoken response regarding its content, vocabulary, grammar and context depends largely upon how well the input speech can be recognized. While state-of-the-art, deep neural net based acoustic models have significantly improved the recognition performance of native speaker’s speech, good recognition is still challenging when the input speech consists of non-native spontaneous utterances. In this paper, we investigate how to train a DNN based ASR with a fairly large non-native English corpus and make it self-adaptive to a test speaker and a new task, namely a simulated conversation, which is different from them monologic speech in the training data. Automated assessment of language proficiency is evaluated according to both task completion (TC) and pragmatic competence (PC) rubrics. Experimental results show that self-adaptive DNNs trained with i-vectors can reduce absolute word error rate by 11.7% and deliver more accurate recognized word sequences for language proficiency assessment. Also, the recognition accuracy gain translates into a gain of automatic assessment performance on the test data. The correlations between automated scoring and expert scoring could be increased by 0.07 (TC) and 0.15 (PC), respectively.
منابع مشابه
Using deep neural networks to improve proficiency assessment for children English language learners
We investigated the use of context-dependent deep neural network hidden Markov models, or CD-DNN-HMMs, to improve speech recognition performance for a better assessment of children English language learners (ELLs). The ELL data used in the present study was obtained from a large language assessment project administered in schools in a U.S. state. Our DNN-based speech recognition system, built u...
متن کاملBidirectional LSTM-RNN for Improving Automated Assessment of Non-Native Children's Speech
Recent advances in ASR and spoken language processing have led to improved systems for automated assessment for spoken language. However, it is still challenging for automated scoring systems to achieve high performance in terms of the agreement with human experts when applied to non-native children’s spontaneous speech. The subpar performance is mainly caused by the relatively low recognition ...
متن کاملIncorporating Uncertainty into Deep Learning for Spoken Language Assessment
There is a growing demand for automatic assessment of spoken English proficiency. These systems need to handle large variations in input data owing to the wide range of candidate skill levels and L1s, and errors from ASR. Some candidates will be a poor match to the training data set, undermining the validity of the predicted grade. For high stakes tests it is essential for such systems not only...
متن کاملThe Relationship between EFL Learners’ Use of Language Learning Strategies and Self-Perceived Language Proficiency
The present study was conducted to investigate whether there was a relationship between EFL learners’ use of language learning strategies and their self-perceived language proficiency at the two levels of intermediate and advanced. A total of 67 subjects (39 intermediate-level and 28 advanced) were selected to participate in this study based on their scores on a piloted language proficiency tes...
متن کاملPrompt-based Content Scoring for Automated Spoken Language Assessment
This paper investigates the use of promptbased content features for the automated assessment of spontaneous speech in a spoken language proficiency assessment. The results show that single highest performing promptbased content feature measures the number of unique lexical types that overlap with the listening materials and are not contained in either the reading materials or a sample response,...
متن کامل